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1 Advanced Computer Architecture 


(a) 


The Branch Target Buffer (BTB) stores information about previously encoun- 
tered branch instructions. A simple design would store the entire address of the 
branch instruction and an entire target address. How could the number of bits 
stored in each entry of the BTB be significantly reduced and what trade-offs are 
involved? [4 marks] 


The BTB and branch predictor are normally accessed on every clock cycle 
regardless of whether the instruction being fetched is a branch. Assuming 
a scalar processor, outline one idea that could be used to help reduce these 
unnecessary accesses? [4 marks] 


Modern superscalar processors are able to support hundreds of instructions 
“in-flight” at the same time and schedule instructions dynamically (i.e. support 
out-of-order execution). What advantages does dynamic scheduling offer when 
compared to an in-order superscalar processor? [4 marks] 


Some processors convert short-forward branches, i.e. those that branch over a 
few instructions, to an instruction that sets a predicate register followed by a 
short sequence of instructions that are conditionally executed depending on the 
value of the predicate. 


(1) Why might such a scheme perform better than simply relying on branch 
prediction and what are its limitations? [4 marks] 


(42) Given a superscalar processor that supports out-of-order execution, briefly 
outline what changes to the processor would be required to support such a 
scheme. [4 marks] 
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2 Bioinformatics 


(a) 


Calculate the dynamic programming matrix and one or more optimal align- 
ment(s) for the sequences GAATTC and GATTA, scoring +2 for a match, —1 for a 
mismatch and with a linear gap penalty of d = 2. [5 marks] 


Determine whether the RNA string GGACCACCAGG should be folded into two 
substructures. [7 marks] 


Discuss how to carry out the multiple sequence alignment of the following three 
sequences: TTTTAAAA, AAAACCCC, CCCCTTTT. [4 marks] 


Discuss which steps of the 1994 Adleman’s DNA computation approach would 
particularly affect the scalability of the number of the visited cities. [4 marks] 
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3 Cryptography 


Your colleagues need a pseudo-random permutation Px : Zygs <* Zyos, over the 
integers in the range 0 to 999999, where K is a 128-bit key. The standard 
library of their development environment offers them only a 128-bit pseudo-random 
permutation, in form of the blockcipher AES-128. 


(a) 


Recalling that 2° = 1.048576 x 10°, they first decide that implementing a 20-bit 
pseudo-random permutation Tx : {0,1}?° © {0,1}? might get them closer to a 
solution. How could they implement 7 using the available AES, function? 

[4 marks] 


One of your colleagues then proposes to use the function 
Pie(m) := (Tc((m)29))* mod 10° 
as a “good enough” approximation of what is required. 


Notation: (+)n : Zon — {0,1}”" encodes non-negative integers as n-bit bitstrings 
and (-)~! : {0,1}* — N does the opposite, i.e. ((i),)~! = 7 for all 0 <i < 2”. 


Propose a distinguisher D that can distinguish P;, from a random permutation 
R: Zio < Zyos using not more than 5000 oracle queries, and show that it 
achieves |P(DP«O = 1)—P(DRO = 1)| > 4 averaged over all K. [6 marks] 


Another colleague then proposes the following algorithm: 


function Px(m): 
c:= Tx ((m)20) 
mee? 
while m> 10°: 
C= Tx(c) 
jae 
return m 


Show that this is in fact a permutation by 
(1) explaining why this algorithm always terminates; [1 mark] 
(ii) providing an implementation of the inverse P;-'(m). [3 marks] 


What side-channel risk could the algorithm for Px (m) from part (c) pose, and 
what can an observer learn from it? [2 marks] 


Propose an alternative algorithm that reduces the risk that an observer can learn 
anything from this type of side channel to a negligible probability. [4 marks] 
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4 Denotational Semantics 


Define © to be the flat domain { T }_. 


(a) Let P be a poset with partial order CE and let S be a subset of P. 


Define M(S) to be the property of S given by Vr ES Vye Parly > yeS. 


Prove that M(S) holds if, and only if, there exists a monotone function 
f:P— S such that f-{T}=S. [8 marks] 


(b) Let D bea cpo. 


(1) For subsets S' of D, define a property C(S) in terms of the cpo structure of 
D such that 


(+) C(S) holds if, and only if, there exists a continuous function 
f:D—®™ such that f-'{T} = 8. 
[4 marks] 


(12) Prove (ft) above. [8 marks] 


5 E-Commerce 


Due to recent changes in the market it is felt that there is an opportunity to build a 
new social networking platform. 


(a) Describe two business models, other than the traditional advertising supported 
model, along with their implications for the social network in terms of product 


development and scalability. [6 marks] 


(b) What legal considerations should you keep in mind if you were to develop an 
advertising-based business model? [4 marks] 


(c) Outline a marketing plan for an advertising-based social network that will grow 
the network’s revenue to the breakeven point. [10 marks] 
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6 Hoare Logic and Model Checking 


Consider a programming language with commands C’ consisting of the skip no-op 
command, sequential composition C,;C2, loops while B do C’ for Boolean ex- 
pressions B, conditionals if B then C; else Cy, assigment X := E for program 
variables X and arithmetic expressions EF, heap allocation X := alloc(F),...,E,), 
heap assignment [F,] := E>», heap dereference X := [LE], and heap location 
disposal dispose(F). Assume null = 0, and predicates for lists and partial lists: 


list (t, []) = (f = null) A emp 

list(t,h :: a) = dy.(tH h) « ((t+1) 6 y) * list(y, a) 

plist (t,, |], t2) = (t: = te) A emp 

plist(t),h :: a,te) = dy. (ti Hh) * ((t1 +1) & y) x plist(y, a, te) 


In the following, all triples are linear separation logic triples. 


(a) Explain why acommand C of your choice satisfies the following triple, or explain 
why no such C exists: {null}> 5} C {T}. [2 marks] 


(b) Explain why acommand C of your choice satisfies the following triple (i.e. moves 
v to a different location): {r HUAX =x} C{Y HvAY Fa}. [2 marks] 


(c) Give a loop invariant that would serve to prove the following triple, for a 
command that creates a reversed copy of a list (no proof outline required). 
{list(X, a) } 

Y := null; C := X; 
while C ~ null do (V := [C]; Y := alloc(V,Y); C := [C+1]) 
{list(X, a) * list(Y, rev a)} [4 marks] 


(d) Adjust the program in (c) with a new loop body Cz, so it (still) terminates and 
{list(X,a)} Y := null; C := X; while C ¥ null do C, {list(Y,rev a)} 
holds (no proof, loop invariant, or termination argument required). {2 marks] 


(e) Consider an unsound extension of the separation-logic proof system with the 
rule {E; > 0A emp} alloc here(F), £2) {F, 1 E,} for a new command 
alloc_here(F), F2). Explain in detail, with reference to the proof rules, how 
{emp} C {1} is derivable, for a non-looping C of your choice. [4 marks] 


(f) Give a loop invariant that would serve to prove the following triple, for a 
command that creates a list of the Fibonacci numbers up to n (no proof outline 
required). Assume fibs(i, 7) = [fib 7,..., fib j] for 7 < 7 and [] otherwise. 

{emp \(N=nAn> 2)} 


II := alloc(i,null); I := alloc(0,II); X := 1; C := 2; 
; IV := [I]; IIV := [TI]; I := II; 
< 
while GSN 4 \ar := alloc(IV+IIV,null); [T+1] := IT; ¢ := ay) 
{list(X, fibs(0, 7)) } [6 marks] 
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7 Information Theory 


(a) Describe the concepts of discrete entropy and mutual information and how they 
relate to each other. [4 marks] 


(b) How does your answer to part (a) change when the system is continuous rather 
than discrete? 2 marks 


(c) How do entropy and mutual information relate to the capacity of a noisy 
channel? 3 marks 


(d) Consider a noisy binary channel with input X and output Y. Under what 
circumstances is H(Y |X) independent of the distribution of X? 3 marks 


(e) A noisy binary channel is modeled as shown in the diagram below: 


0 0 
f 

? 
f 

1 1 


The probability of a bit being flipped is f. The probability of a bit being erased 
is e. Derive the capacity of this channel and the probability distribution of the 
input bits that achieves it. [8 marks] 


You may use the following equality without proof: 


H(a,1—a-—b) —alog,(a) — (1 — a — 6) log,(1 — a — b) 


a a 


a-9H (-p1- 4] — (1 — 6) log,(1 — b) 
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8 Machine Learning and Bayesian Inference 


(a) 


(b) 


State the central limit theorem for a sequence X1, X9,..., Xn of independently 
and identically distributed (iid) random variables having mean E(X) = yp and 
variance var(X) = 0°. [2 marks] 


Explain how the central limit theorem can be used to provide a two-sided 
confidence interval for the estimate of the mean of a random variable. 
[4 marks] 


When X € {0,1}, explain how an estimate of E(X) can be used to obtain an 
estimate of var(X). [2 marks] 


We know that if X is normal distributed with mean 0 and variance 1 then 
Pr(—1.96 < X < 1.96) > 0.95. You have trained a classifier using algorithm A 
on a data set and tested it using 1000 test examples. You obtain 57 errors. Find 
a two-sided 95% confidence interval for the accuracy. [3 marks] 


You have a trained second classifier using algorithm B and the same data set 
as in Part (d), which, when tested using the same 1000 test examples, gives 55 
errors. Denoting by a the actual accuracy of the first classifier, and by a’ the 
actual accuracy of the second classifier, find a two-sided 95% confidence interval 
for the difference (a — a’) in the accuracies of the two classifiers. State any new 
assumptions that you make. [4 marks] 


Your boss has a vested interest in arguing that algorithm B is better than 
algorithm A for the problem of interest, and argues that the measured accuracies 
confirm this view. Discuss the pros and cons of this conclusion, and suggest how 
you might conduct further experiments to help your boss. [5 marks] 
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9 Optimising Compilers 


The following excerpt from a program in C-style code is optimised by a compiler 
using data-flow analyses and transformations. Assume that variables x, y and z have 
already been defined: 


a=x-y 

if a. > 3) 
b=at+z 
c=x-y 

} else { 
b=at+z 
a=arx*b 

i 

d=x-y 

b= be fd 


print(a * b) 


Using available expression analysis, perform common subexpression elimination 
on the code showing the results of both the analysis and transformation. 
[5 marks] 


Using very busy expression analysis, perform code hoisting on the code from 
part (a) showing the results of both the analysis and transformation. 
[4 marks] 


Using reaching definition analysis, perform copy propagation on the code from 
part (b) showing the results of both the analysis and transformation. [Hint: use 
the results of the analysis to transform across basic blocks.] [4 marks] 


Using live variable analysis, perform dead code elimination on the code from 
part (c) showing the results of both the analysis and transformation. 


[4 marks] 


Perform if simplification on the code from part (d) showing the result of the 
transformation. [3 marks] 
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10 Principles of Communications 


(a) A mobile host moves while transmitting packets via IP multicast. What effect 
will this have on multicast routing algorithms? [10 marks} 


(b) A mobile host is seeing a high rate of packet loss. How will this affect the 


throughput seen by a TCP connection that it has to some fixed server? 
[10 marks} 
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11 Quantum Computing 


(a) What problem does Grover’s search algorithm tackle, and what is its advantage 


(b) 


over the best classical algorithm for this task? [2 marks] 


Let there be a database containing 32 elements, indexed by the binary numbers 
00000 to 11111. A single element 00110 is marked. 


(i) Give an oracle circuit that identifies the marked element. [1 mark] 


(ii) If Grover’s search algorithm is applied to find the marked element, what 
should the initial state be set to, and what is the state after a single Grover 
iterate has been applied? [4 marks] 


(117) To find the marked element with maximum probability requires N iterates 
in total. What is the value of N, and what is the probability of correctly 
finding the marked element? [4 marks] 


(iv) If the algorithm is instead run with 3N iterates in total, what is the 
probability of correctly finding the marked element? Comment on your 
answer. [2 marks] 


Let V be an oracle circuit that marks one or more elements, acting as follows: 


V(\x)|@)) = |x)la ® f(x) 


Here a takes the values 0 or 1, and we have f(x) = 1 when z is the index of 
a marked element, and f(x) = 0 otherwise. How could V be altered to allow 
Grover’s search to find an unmarked element? [2 marks] 


(d) A Grover iterate consists of the oracle circuit, typically denoted V, followed by 


a circuit W: 
(1) What is the function of W? [1 mark] 
(ii) What would happen if the order of V and W were swapped, such that 


Grover’s algorithm is run with V following W as the Grover iterate? 
[4 marks] 
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12 Randomised Algorithms 
In this question, all considered graphs are undirected and d-regular. 
(a) State the definition of conductance. [2 marks] 


(b) If G is disconnected, what does this imply in terms of the conductance? 
[1 mark] 


(c) If Gis disconnected, what does this imply in terms of the spectrum of L? Briefly 
justify your claim. [3 marks] 


(d) The d-dimensional hypercube with n = 2% vertices is defined by creating a 
vertex for each d-digit binary number (21, %2,...,%a) € {0,1}¢. Further, any 
two vertices are adjacent if and only if their binary representations differ in 
exactly one digit. 


(i) Identifying each binary representation (1,22,...,%a) € {0,1}¢ with a 
unique vertex label in {1,2,..:,n}, verify that ft. s...422) = (—1)™ is 
an eigenvector of the Laplacian Matrix L of the hypercube. State the 
associated eigenvalue of f for both L and the adjacency matrix A. 

[7 marks] 


(it) Apply the Spectral Clustering Algorithm to estimate the conductance of 
the hypercube, assuming that f in (d)(z) is the eigenvector of A2. [Hint: It 
suffices to apply the (n/2)-th sweep cut only.| [5 marks] 


(iii) Combining (d)(7) and (d)(2), what can you conclude about the found cut? 
[2 marks] 
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13. Types 


Consider Gédel’s T, the simply-typed lambda calculus with function and natural 
number types, with zero, successor and iterator term formers for the natural number 
type. 


(a) Define a logical relation suitable for establishing the termination of closed 
programs in this language. 5 marks 


(b) State the fundamental lemma for this language. 3 marks 


(c) State formally what it means for a set of terms X to be “closed under reduction”. 
2 marks 


(d) Prove the fundamental lemma holds for the iterator case. [10 marks 


END OF PAPER 
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